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Abstract 


Bacteriophage Tg is a virus that infects Escherichia_coli, having dimensions of 90 nm in width and 200 nm in length 
(head and tail in extended form). [2] It is a quite common model organism that has been studied for a century by 
many important virologists, and even Watson and Crick after their elucidation of DNA. Structural characterisation 
of the bacteriophage’s individual proteins began in the 1980s,[2] and complexes of multiple proteins in the 
1990s.[3] However, it has not yet been possible to structurally characterise the complete phage in atomic detail 
(though some have begun to come closer)[4] with multiple overall schematic models published.[5]The increasing 
power of computers and the RCSB structural database have made possible the construction of a single combined 
model of the entire bacteriophage T4 organism with atomic resolution components as described here. 
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Introduction 


The complete structural model for bacteriophage Ty 
has been constructed thanks to the determination of 
the structures of single proteins that constitute the vi- 
rus as well as various parts of the virus. First, the capsid 
(head) of the virus was constructed using a 3D cryoEM 
reconstruction where each individual protein was fit- 
ted into the EM density: soc in orange, hoc in blue, 
gp23 in green, gp24 in magenta, and gp20 in red 
(though gp2o is hidden between the head and tail). A 
similar procedure was followed for the construction of 
the tail and tail fibers.[6][7] Although each of the com- 
ponent proteins has been described in detail during 
decades of research, this combined structural model, 
whilst not perfect, is the best reconstruction of the en- 
tire organism that we have today as of 2021. 


Description 
Further information: Wikipedia:Bacteriophage T4 


This reconstruction largely used the molecular visuali- 
zation software UCSF Chimera[8] in a personal com- 
puter, and where necessary for some tasks, in super- 
computers like Bridges for Pittsburgh and Frontera 
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from Texas. This combined structural model has been 
constructed and updated over time my work in Catholic 
University of America (where | started in 2007) as new 
structures of the components have been solved by 
other researchers, and therefore is the accumulated 
work of many years of research. 


There are approximately 50 structural proteins that as- 
semble the virus which is constructed with protein data- 
bank (PDB) structures and one cryoEM reconstruction 
from the Electron Microscopy Data Bank (EMDB) corre- 
sponding to the brown ring between head and tail. This 
structure can be used for teaching at any level of educa- 
tion and research because is accurate at atomic resolu- 
tion and therefore can be used to derive hypotheses 
(e.g., how antigen-display technology could be used 
with bacteriophage T4). The protein colours are chosen 
to differentiate between them and make contrast as 
well as showing art in science, but some colours are the 
same for different proteins (e.g., soc in the head and 
gp18 in the tail). As a reference for further publications, 
you can look at Bacteriophage T4 in Wikipedia where 
each protein is described in different research articles. 


List of PDB and EMDB structures 


e Head: hoc (PDB: 3shs - blue), soc (PDB: 5vf3 - or- 
ange), gp23 (PDB: svf3 - dark green), gp24 (PDB: 
5vf3 - dark magenta), gp20 (PDB: 6uzc - dark red) 


e — Collar: brown (EM density from EMDB: 1075) 
e Wac fibres: PDB: 2bsg - dark green 


WikiJournal of Science, 2021, 4(1):2 
doi: 10.15347/wjs/2021.[10.15347/WJS/2021.005] 
Review Article 


Figure 1 | Bacteriophage T4 Structural Model, Dr. Victor Padilla-Sanchez, PhD - Own work CC BY-SA 4.0 


e — Neck: brown gp23 and gp14 are represented by an 
EM density from EMDB: 1075 


e — Tail:[7] gp15 (PDB: 3j2m - orange), gp18 (PDB: 
3j2m - orange/red) 

e Tail base plate:[7](PDB:sivs-dark 
green/blue/dark red/purple) 


e Long Tail Fibers:[1] composites of gp37 (PDB: 
2xgf) and gp34 (PDB: 4uxf) repeated to form the 
long tail fibre. 


Note: same protein colours do not necessarily define 
same proteins. 
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